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Abstract — In this paper, we determine the appropriate decay 
function for item-based collaborative filtering (CF). Instead of 
intuitive deduction, we introduce the Similarity-Signal-to-Noise- 
Ratio (SSNR) to quantify the impacts of rated items on current 
recommendations. By measuring the variation of SSNR over 
time, drift in user interest is well visualized and quantified. 
Based on the trend changes of SSNR, the piecewise decay 
function is thus devised and incorporated to build our time-aware 
CF algorithm. Experiments show that the proposed algorithm 
strongly outperforms the conventional item-based CF algorithm 
and other time-aware algorithms with various decay functions. 

I. Introduction 

On the rapidly changing Internet, user interest constantly 
changes over time, which presents a unique challenge for 
practical recommender systems. To cope with this, time- 
aware collaborative filtering (CF) is proposed as a solution to 
provide timely recommendations by exploiting the temporal 
information in rating^ data [1][2], such as the date the rating 
is generated. A straightforward and low-complexity scheme is 
to incorporate, in the framework of conventional CF, time- 
dependent weights in accounting the influence of ratings 
with different ages. As rating influence generally decreases 
with age, decay functions are introduced to weigh ratings 
[1]. Though benefits from time-awareness are observed in 
several experiments [1][2], the forms of decay function are 
still determined by intuitive deduction because of limited 
understanding on temporal data. In this paper, we identify and 
verify the appropriate form of decay function by introducing 
a novel measure to quantify the influence of ratings, which is 
the most crucial issue to understand and better utilize temporal 
rating data. The major contributions are two-fold: 

- Revealing dynamics of individual rating behavior. A new 
index, the Similarity-Signal-to-Noise-Ratio (SSNR), is 
introduced to quantify the impact of one's past ratings 
on his/her current favorite. By measuring the SSNR on 
a real dataset with precision in one second, we provide 
a clear picture of the rating impact variation over time. 
Specifically, we observe three distinct phases in the time 

'Actually, various types of user behavior other than rating exist, but in 
this paper, the term rate and rating are loosely used to indicate all behaviors 
showing user's preference, for example saving a bookmark, buying a product, 
or voting on a movie. 



variation of SSNR, namely a short-term decay, a long- 
term decay and a plateau between them. By consider- 
ing the corresponding time scales, we suggest different 
mechanisms to explain the short- and long-term decay 
respectively. 

- The proposal of the piecewise decay function. By drawing 
analogy with signal combining, we apply the Maximal 
Ratio Combining (MRC) method, and derive that the 
weights of ratings should be proportional to their SSNR. 
Thus, the observed time variation of SSNR provides a 
good reference for the decay function, and the three 
distinct phases naturally lead to a piecewise power decay 
function. Incorporating the decay function with CF leads 
to our proposed time-aware recommendation algorithm. 

To examine the recommendation quality, we test our pro- 
posed algorithm on the Delicious dataset, compared with 
conventional item-based CF and other time-aware algorithms 
with various decay functions. With hit-rate as the evaluation 
metric, remarkable improvements ranging from 11% to 63% 
are observed. 

II. Related Work 

Time-aware CF algorithms are highly promising in raising 
recommendation accuracy and timeliness. Among them, a kind 
of recency-based algorithms, which employ the time window 
or the decay function to emphasize recent ratings, are preferred 
for their simplicity and adaptability to drifting of user interest. 
Reference [1] adopted the exponential function to produce 
time-dependent weights for ratings. A more complex scheme 
was proposed in [3], which weighs ratings by their prediction 
accuracy for recent ratings instead of by timestamps. Further- 
more, similar ideas were also used in [2] and [4], incorporated 
with more complex user interest models. Besides the user 
interest drifting, studies in [2] [4] [5] tried to capture more 
temporal effects, but brought high computation burden at the 
same time. 

Besides in the time-aware CF, decay functions also play 
an important role in many applications. An coarse-grained 
discrete function is used in [6] to convert implicit ratings to 
multi-value ratings. In [7], the linear function is adopted to 
weigh examples for a content-based recommender system. The 



TABLE I 

Basic Information of the Delicious Data 



Users 


Items 


Ratings 


Sparsity 


Period 


14025 


318415 


1806951 


0.9996 


Jan.2004 - Aug.2007 



most widely used decay function, exponential function, is used 
in [8] for mining concept drifting data. 

III. Temporal Dynamics in Ratings Data 

A. Delicious Data 

The rating data examined in subsequent analyses is collected 
from a well known social bookmarking website, delicious.com. 
Ratings in the Delicious data are users' implicit feedbacks. 
When user a saved bookmark (item) i at time t a i, a rating 
occurred. To study the dynamics with a fine time scale, the 
timestamp t a i is recorded with precision in one second. 

After data collection, many bookmarks in the dataset are 
found to be saved by only one user. These bookmarks have 
no contribution in making recommendations, as they have 
no user overlap, and thus no quantified relation with all 
other bookmarks. Hence we remove them to avoid irrelevant 
computation. The attributes of the pre-processed dataset are 
summarized in Table Q] Note that the sparsity of the Delicious 
data is very high and poses a big challenge to recommendation 
algorithms [5]. As to preserve its original features and make 
closer correspondence between our study and the real world, 
we make no further modification on the data. 

B. Temporal Dynamics of Rating Impact 

Intuitively impacts of ratings should decay with the lapse 
of time. It implies that recent ratings are more relevant than 
old ratings to identify one's current favorite. Many time-aware 
CF algorithms are based on this deduction, but to the best of 
our knowledge, there is no concrete evidence or clear picture 
about how the decay goes with time. Limited knowledge on 
the decay process hinders the potential improvements of the 
algorithms. 

In this paper we address the problem of appropriate decay 
function by empirical analyses on a real dataset. Firstly, a 
quantitative measure is needed to quantify the rating impact 
on the current recommendation. One candidate is the cosine 
similarity [9], which is defined as 



_ / 2~ / 2~ ' 

where s« is the similarity between items i and j, and U 
corresponds to the set of users. For the Delicious data, we 
adopt a binary implicit rating and put r a % = 1 if item i is saved 
by user a as a bookmark, and otherwise r a % — 0. We further 
denote the current favorite item of user a as i* , such that the 
similarity between his/her saved item i and i* characterizes 
the impact of rating r a i. 

However, evaluating similarity is not sufficient to quantify 
rating impact. Consider a simple case with two rated items i 



and k by user a. For item i, su* — 0.8, but all Sij, where 
j ^ k and i* a , are larger than 0.9. For item k, s^* = 0.2, 
but all Skj, where j ^ i and i* a , are smaller than 0.1. Though 
Sii* > Sfci*, apparently item k is more important than item 
i to identify i* , the current favorite of user a. Therefore, the 
relative similarity with the current item is more relevant than 
the sheer similarity. 

To quantify the relative similarity with the favorite item, 
we draw analogy with signal and noise in signal processing. 
Specifically, su* a can be considered as the useful signal carried 
by the rated item i, and s.y, where j ^ i* a , can be considered as 
noise. By drawing analogy with the standard Signal-to-Noise- 
Ratio [10], we introduce the Similarity-Signal-to-Noise-Ratio 

SSNR a i = 5 "° T . (2) 

High SSNR a i implies item i has a strong impact on predict- 
ing the favorite item of user a. 

To investigate the decay of SSNR a i with the age of rating 
r a i, we conduct statistical analyses on the Delicious data. For 
each user a in the dataset, we leave his/her latest rating out, 
and consider the corresponding item as his/her current favorite. 
The latest rated item and its timestamp are thus denoted as i* a 
and t* . The SSNR of other rated items are then evaluated by 
equation (f2j) with this favorite item. For each rating of user 
a (except the latest one), a pair of (SSNR a i,age a i), where 
ag&ai = t* a — t a i, is obtained and reveals the relationship 
between rating impact and age. In this case, we will compute 
L— 1 pairs of (SSNR a i, age a i) for a user with L rated items, 
and the same computation is conducted for all users. 

In Fig. Q] we show SSNR a i as a function of age a i- We 
log-bin age a i and average SSNR a i over each bin. Fig. Q] 
is shown in log-log scale to illustrate the behaviors of small 
SSNRai or with small age a i- A trendline is added to outline 
the variation of SSNR. 
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Fig. 1. Rating SSNR as a function of age. A trendline (the solid line) is 
added to outline the variation of SSNR. 

Despite fluctuation is relatively strong, the SSNR curve 
shows three distinct phases. Specifically, short- and long-term 
decay are respectively observed within 10 4 seconds (w 3 



hours) and beyond 10 seconds (« 10 days), with a plateau 
connecting the two phases. SSNR a i, defined as the impact of 
rated item i on current recommendation, is also a measurement 
of how much interest of user a changes in time age a i. 
Therefore, Fig. Q] provides us a clear picture of user interest 
drifting over time. 

The corresponding time scales provide hints to explain 
the origin of the decays. As neither individual nor global 
preference would have great changes within 10 4 seconds [2], 
we attribute the short-term decay to the switch of users' 
focuses. As Fig. [T] is shown, the values of SSNR within 10 4 
are higher than the remains, which can indicate that users' 
short-term focuses are more correlated. But the short-term 
decay shows that users' focuses are drifting over time. By 
examining a dataset with precision in one second, for the first 
time we uncover the short-term decay in rating data. In past 
studies, these transient effects are regarded as interference 
to recommendations and suggested to be filtered out [2] [4], 
but we will show in following sections that they indeed 
have significant contributions in improving recommendation 
accuracy. 

The long-term decay has been widely discussed in related 
literature [l]-[4] [7], which reflects the change of user's intrin- 
sic interest. Our quantitative results show that the change does 
not occur in all time scales. It is only notable beyond about 
ten days, but within this time window, user interest almost 
stays the same. We suggest two main causes for the long- 
term decay. The first one is the attraction from new items, 
which constantly alters the hotspots of the society. The second 
one is the change of users' intrinsic characteristics, such as 
age, profession, social relationship, etc. These changes happen 
slowly and less often, but are very likely to affect users' 
preferences. 

IV. Collaborative Filtering with the Piecewise 
Decay Function 

Exponential decay was proposed long ago to characterize 
the decay of rating impact and is now widely used. It is based 
on human's forgetting curve, which implies that the ratings 
decrease in impacts as time goes on because people forget 
them. However, as discussed, the temporal rating behaviors 
root in reasons in addition to forgetting. In this section, we 
will derive an appropriate decay function to incorporate with 
the item-based CF, which constitute our proposed algorithm. 

In general, recommendations by time-aware item-based CF 
are based on 

= w ( a 9e a i)sij, (3) 

where f a j is the prediction score of item j for user a, I a 
denotes the set of all items rated by user a and w(t) is the 
decay function employed to weigh items with different ages 
[1][9]. For user a, all his/her unseen items are sorted by f a j 
in the descending order, and the top- A items are delivered as 
the recommendation results. 

In analogy with SSNR on similarity, we can also define 
SNR on the final predicted probabilities as fSNR a = 



fai* I J2k^i* fuk> where i* a is again the current favorite of 
user a. Obviously, good recommendation results should give 
item i* a high rank and hence a large fSNR a . We then 
have to assign appropriate weights, i.e. decay function, to 
maximize the outcoming fSNR. By drawing analogy with 
signal combing problem, where MRC is employed for the 
same purpose^, we obtain w{age a i) — SSNR a i. Based on 
the variation of SSNR a i with age a i in Fig. Q] we take the 
trendline of the curve to derive w(age a i). Mathematically, the 
decay function reads 

f (t:) K \ o<t<r s , 

w(t) = ! 1, T s <t<T h (4) 

{ (-fr)"*'. r,<t. 

Substituting © into (0, we propose a time-aware CF algo- 
rithm with the piecewise decay function. In equation ©, four 
free parameters, T s , Ti, K s and Ki, are introduced for fine 
tuning to achieve the optimal algorithmic performance. T s and 
Ti are respectively the time thresholds of short- and long- 
term decay, and K s and A) are the control parameters for the 
corresponding decay rate. 

V. Experiments 

A. Experiment Design 

We evaluate our algorithm on the Delicious data described 
in section IIII-AI As the task of recommender systems is to 
identify one's current favorite, we adopt the so-called leave- 
the-latest-out method for cross validation, rather than the 
traditional K-fold or leave-one-out method. The latest rating 
of each user, say user a, is left out as a probe when making 
recommendations for him/her, and all the other ratings serve 
as the training data. The timestamp t* of the latest rating is 
regarded as the time when the recommendation is made, and 
is used to calculate ratings' ages. The latest rated item i* a is 
the test item for evaluating the recommendation accuracy. 

In this paper, hit-rate is employed as the evaluation metric 
for recommendation accuracy [9]. The hit-rate for a specific 
search depth A is defined as 

1 v-^ h(i*N) 
H@N = — > — - (5) 
\U\ ^ A ' W 

where \U\ is the number of users, h(i* a: N) — 1 if item 
i* is top- A sorted by prediction scores, and h(i* a ,N) = 
otherwise. 

B. Decay Functions 

The proposed algorithm is simulated with the parameters 
K s e [0.1,1], Ki e [0.1,1], T s e [100, 1O 5 and T t G 

2 MRC is the optimal combiner for independent Additive-White-Gaussian- 
Noise channels. For other channel types, MRC is also widely adopted , 
because its basic idea of boosting the strong signal components and attenuating 
the weak components will surely improve performance when compared with 
Equal Gain Combing. 

3 In this paper, all values of time is in unit of one second. 



TABLE II 

Algorithms' Hit-Rate and Improvements 



TABLE III 
Optimal Parameters 



Algorithms 


H@10 

(xicr 3 ) 


Improvement 


H@20 
(xl(T 4 ) 


Improvement 


H@50 
(xl0~ 4 ) 


Improvement 


IBCF 


1.03 


6.84 




4.23 




WIN 


1.29 


t 25.2% 


8.23 


t 20.3% 


4.68 


t 10.6% 


LOG 


1.44 


t 39.8% 


8.71 


t 27.3% 


4.59 


t 8.5% 


EXP 


1.51 


t 46.6% 


9.00 


t 31.6% 


4.66 


t 10.2% 


OUTRADAY 


1.54 


t 49.5% 


9.74 


t 42.4% 


5.48 


t 29.6% 


Proposed 


1.68 


t 63.1% 


10.6 


t 55.0% 


5.76 


t 36.2% 



T w 


10' 


T 


30000 


T e 


50000 


K 


0.9 


T s 


50000 


Ti 


10 b 


K s 


0.6 


Ki 


0.3 



[5x 10 5 , 5x 10 7 ]. To demonstrate the benefits from temporal in- 
formation, we include the conventional Item-Base CF (IBCF) 
in our experiment. Besides, we also evaluate time-aware CF 
with the following decay functions for comparison. 
• WINdow function (WIN): 

' 1, t<T w 
0, t>T w ' 



w w {t) 



(6) 



where the free parameter T w £ [100, 10 8 ]. 
LOGistic function (LOG) [1]: 

1 

Wnit) = —7 -, 

BW 1 + exp(^- - b) ' 



(7) 



where the free parameter T g G [1, 10 8 ], and b is set to 5 
in our experiment. 
EXPonential function (EXP) [1]: 



w e (t) = exp(- 



- — ), 



where the free parameter T e S [1, 10 8 ]. 
OUTRADAY decay function (OUTRADAY): 

1, t < 86400 



w (t) 



( 86400 ) 



t > 86400 



(8) 



(9) 



where the free parameter K Q G [0.1,2]. The outraday 
decay function is similar to the proposed decay function, 
except that it ignores the short-term decay which happens 
within one day. 

C. Results 

In our experiment, for each algorithm we calculate many 
groups of H@10, H@20 and H@50 by tuning the free 
parameters in the given regions. Then, the results with optimal 
H@10 are selected and shown in Table [Til and at the same 
time the optimal parameters are given in Table [111] We also 
list the improvements of each algorithm when compared with 
the IBCF algorithm. Apparently, all time-aware algorithms 
strongly outperform the IBCF algorithm. Among them, the 
proposed algorithm achieves the best performance, and the 
improvements are significant. Remarkably, the difference in 
improvement between the proposed and OUTRADAY algo- 
rithm shows the great importance of short-term decay, as the 
two algorithms are identical except that the latter ignores the 
dynamics within one day. This again confirms the importance 
of the present study, as most past studies overlook the benefits 
of examining dynamics shorter than one day [2] [5]. 



VI. Conclusions and Discussion 

In this paper, we quantified user interest drifting by a novel 
quantitative measure by analogy with Signal-to-Noise-Ratio, 
and applied the findings on our time-aware CF algorithm by a 
carefully designed decay function. We uncovered and utilized 
the short-term decay shorter than one day, which is overlooked 
in the past studies. Experiments show our great algorithmic 
improvement compared with the present state-of-the-art. 

It is worth noting that, users' activities in the Internet are 
more bursty than what we observed from the Delicious data, 
where rich short-term information is ready to be utilized to 
improve recommendation. As a further example, we construct 
a semi-artificial dataset from the Delicious data, which em- 
phasizes the bursty behaviors. Experiments on this dataset 
demonstrate that an improvement of 110% is achieved by the 
proposed algorithm when compared with the IBCF algorithm. 
The results will be presented in details in an extended paper. 
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